Klasifikasi Berita Menggunakan Metode K-Nearest Neighbor

نویسندگان

چکیده

Abstrak - Meningkatnya minat masyarakat dalam mengakses berita, khususnya berita online, menuntut redaktur dan situs portal untuk memberikan liputan yang berkualitas. Selain itu, klasifikas ada masih tergolong umum dapat menjadi kendala dialami pembaca. jika pembaca ingin melihat kategori lebih spesifik, mereka harus menyaring tersebut secara manual. Hal ini juga terjadi di bidang sosial Badan Pusat Statistik Provinsi Riau kesulitan mencari tentang Riau. Oleh karena proses klasifikasi menggunakan metode k-nearest neighbor hal krusial dilakukan. Jumlah digunakan penelitian berjumlah 510 data dengan tiga yaitu demokrasi, kemiskinan, ketenagakerjaan. Proses meliputi: pengumpulan data, pelabelan manual, preprocessing teks, pembobotan kata, memakai neighbor. cosinus similarity meningkatkan nilai akurasi. Nilai akurasi tertinggi diperoleh pada adalah 87% k = 3 distribusi uji 20% latih dari 80%. Dari diambil kesimpulan bahwa K-Nearest Neighbor bekerja baik berita.Kata kunci: Statistik, Berita, Cosine Similarity, Klasifikasi, Abstract The increasing of public interest in accessing news, especially online requires editors and news sites to provide quality coverage news. In addition, the grouping that still classified as a general can be an obstacle experienced by readers. if reader wants see more specific category they must filter manually. This is also happened social sector Riau, which has trouble when finding about Province. Therefore, classification process using method crucial thing do. number stories used this study amounted with three categories, democracy, poverty, employment. includes: collection, manual labeling, text preprocessing, word weighting, method. Besides that, cosine increase accuracy value. highest values obtained were distribution test training From research, it concluded works well process.Keywords: Classification, Neighbor, News

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Drought Monitoring and Prediction using K-Nearest Neighbor Algorithm

Drought is a climate phenomenon which might occur in any climate condition and all regions on the earth. Effective drought management depends on the application of appropriate drought indices. Drought indices are variables which are used to detect and characterize drought conditions. In this study, it was tried to predict drought occurrence, based on the standard precipitation index (SPI), usin...

متن کامل

Fast Approximate Nearest-Neighbor Search with k-Nearest Neighbor Graph

We introduce a new nearest neighbor search algorithm. The algorithm builds a nearest neighbor graph in an offline phase and when queried with a new point, performs hill-climbing starting from a randomly sampled node of the graph. We provide theoretical guarantees for the accuracy and the computational complexity and empirically show the effectiveness of this algorithm.

متن کامل

Unsupervised K-Nearest Neighbor Regression

In many scientific disciplines structures in highdimensional data have to be found, e.g., in stellar spectra, in genome data, or in face recognition tasks. In this work we present a novel approach to non-linear dimensionality reduction. It is based on fitting K-nearest neighbor regression to the unsupervised regression framework for learning of low-dimensional manifolds. Similar to related appr...

متن کامل

Neighbor-weighted K-nearest neighbor for unbalanced text corpus

Text categorization or classification is the automated assigning of text documents to pre-defined classes based on their contents. Many of classification algorithms usually assume that the training examples are evenly distributed among different classes. However, unbalanced data sets often appear in many practical applications. In order to deal with uneven text sets, we propose the neighbor-wei...

متن کامل

Evolving edited k-Nearest Neighbor Classifiers

The k-nearest neighbor method is a classifier based on the evaluation of the distances to each pattern in the training set. The edited version of this method consists of the application of this classifier with a subset of the complete training set in which some of the training patterns are excluded, in order to reduce the classification error rate. In recent works, genetic algorithms have been ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Jurnal nasional komputasi dan teknologi informasi

سال: 2022

ISSN: ['2620-8342', '2621-3052']

DOI: https://doi.org/10.32672/jnkti.v5i2.4192